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BD141394 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD141394 834 bp DNA linear PAT 18-SEP-2002 

Novel carbonyl reductase, gene thereof and method of using the 
same. 
BD141394 

BD1413 94 . 1 GI : 23236339 
WO 0210399-A/l. 
Micrococcus luteus 

Micrococcus luteus 

Bacteria; Actinobacteria; Actinobacteridae ; Actinomycetales ; 
Micrococcineae; Micrococcaceae; Micrococcus . 
1 (bases 1 to 834) 

Kizaki,N., Yasohara,Y. and Hasegawa,J. 

Novel carbonyl reductase, gene thereof and method of using the same 
Patent: WO 0210399-A 1 07-FEB-2002; 

KANEKA CORP,NORIYUKI KIZAKI , YOSHIHIKO YASOHARA, JUNZO HASEGAWA 
OS Micrococcus luteus 
PN WO 0210399-A/l 
PD 07-FEB-2002 

PF Ol-AUG-2001 WO 2001JP006619 
PR Ol-AUG-2000 JP OOP 232756 

PI NORIYUKI KIZAKI, YOSHIHIKO YASOHARA, JUNZO HASEGAWA PC 
C12N15/53 , C12N9/02 , C12N1/21 , C12P17/10// (C12N15/53 , C12R1 : 265) CC 
Novel carbonyl reductase, gene thereof and 
method of using the 
CC same. 

FH Key Location/Qualifiers 
FT source 1 . . 834 

FT /organism= 'Micrococcus luteus', 

Location/Qualifiers 
1. .834 

/organism= "Micrococcus luteus" 

/mol_type= "genomic DNA" 
/db xref="taxon:1270" 



Query Match 59.1%; Score 834; DB 6; Length 834; 

Best Local Similarity 100.0%; Pred. No. 5.8e-88; 

Matches 834; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



QY 



Db 



108 ATGCGACGGATGACGCTGCCGAGTGGGGAGTCCATCCCTGTGCTGGGCCAGGGCACCTGG 167 

IIIIIIMIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIMIIIIIMIIMIMIIII 

1 ATGCGACGGATGACGCTGCCGAGTGGGGAGTCCATCCCTGTGCTGGGCCAGGGCACCTGG 60 



Qy 



Db 



168 GGCTGGGGTGAGGACCCCGGCCGCCGCGGCGACGAGGTCGCCGCGCTGCACGCCGGCCTC 227 

Ml Mill I II I lllllll III Ml I Mill 1 1 II III 1 1 Mill I II I Ml I Mill M 

6 1 GGCTGGGGTGAGGACCCCGGCCGCCGCGGCGACGAGGTCGCCGCGCTGCACGCCGGCCTC 120 



Qy 



Db 



228 GAGCTGGGCATGACGCTGGTCGACACCGCCGAGATGTACGCCGACGGCGGTGCGGAGGAG 287 

III IMIIIIIIIIMMIIIIIIIIIMMMIIIIMI IIMMMIIIIIIMIIII 

121 GAGCTGGGCATGACGCTGGTCGACACCGCCGAGATGTACGCCGACGGCGGTGCGGAGGAG 180 



Qy 



Db 



288 GTGGCTGGTGAAGCATTGGCGGGTCGCCGCGACGAGGCGTTCGTGGTCAGCAAGGTCATG 347 

IIIIIIIIIIIIIIIIMIIIIIIIIIMIMIIIIIMIIIMII IIIIIIIMIIIII 

181 GTGGCTGGTGAAGCATTGGCGGGTCGCCGCGACGAGGCGTTCGTGGTCAGCAAGGTCATG 24 0 



348 CCGTCCCACGCCTCCCGTTCCGGCACGATCGCGGCCTGCGAACGCAGCCTGAAACGCCTG 407 

IIIIIIIIIMIIIIIIIIIMIIIIIIMMIIMIIIIIIUIIIIMIIIIIIIIM 

241 CCGTCCCACGCCTCCCGTTCCGGCACGATCGCGGCCTGCGAACGCAGCCTGAAACGCCTG 300 
4 08 GGCACCGATCGGATCGACCTCTACCTGCTGCACTGGCAGGGCAGGTACCCGCTGCAGGAC 4 67 

llllllllliMIIMIIIIIIIMIIMIIIIIIIIIMIIIIIIMIIIIIIIIIIII 

3 01 GGCACCGATCGGATCGACCTCTACCTGCTGCACTGGCAGGGCAGGTACCCGCTGCAGGAC 360 
468 ACCGTCGCGGCCTTCCACCAGCTCGTCGAGGACGGGAAAATCCGATACTGGGGCGTCAGC 5 27 

lllllllllllllllllllillllllllllllllllllllllllllllllllllllllll 

3 61 ACCGTCGCGGCCTTCCACCAGCTCGTCGAGGACGGGAAAATCCGATACTGGGGCGTCAGC 420 
528 AACTTCGACCACCGGGCCCTCGCCGAGCTGCAGGACGTGCCGGGCACCAGCGGGCTGACC 587 

IlillllilllllMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

421 AACTTCGACCACCGGGCCCTCGCCGAGCTGCAGGACGTGCCGGGCACCAGCGGGCTGACC 480 
588 ACGGATCAGGTGCTGTACAACCTGTCGCGGCGAGGACCGGAGTACGACCTGCTGCCGTGG 647 

IIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIMIIIIIIMIMIMIIIIIIII 

4 81 ACGGATCAGGTGCTGTACAACCTGTCGCGGCGAGGACCGGAGTACGACCTGCTGCCGTGG 54 0 
648 TGCGCCGACCACCAGCTGCCGGTCATGGCGTACTCGCCGATCGAGCAGGGCCGCATCCTT 707 

IIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIIIII 

541 TGCGCCGACCACCAGCTGCCGGTCATGGCGTACTCGCCGATCGAGCAGGGCCGCATCCTT 600 
708 GACGACACGACGCTGAACGACGTCGCGGCCCGTCACAGCGTCAGCCCCGCGGCGGCGGCC 767 

IIIIIIIMIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIMIIIIII 

601 GACGACACGACGCTGAACGACGTCGCGGCCCGTCACAGCGTCAGCCCCGCGGCGGCGGCC 660 
768 CTTGCCTGGGTGCTGCGCCGCGACTCGCTCTGCACGATCCCCAAGGCGAGCAGCCCGCAG 827 

MIMIIIIIIIIIIIIIIIIIIMIIIIIIMIMIIIIIMIIIIIIIIIIIIIIIII 

661. CTTGCCTGGGTGCTGCGCCGCGACTCGCTCTGCACGATCCCCAAGGCGAGCAGCCCGCAG 720 
82 8 CACGTGCGCGACAACGCCACAGCACTGGACGTGGAGCTGACCCGCGAAGACCTGGATGCT 887 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIMIIIIIIIIIIMIIIII 

721 CACGTGCGCGACAACGCCACAGCACTGGACGTGGAGCTGACCCGCGAAGACCTGGATGCT 780 
888 CTGGACCGTGCGTTTCCGCCCCCGAGCGGACCGCGACCACTGGAAATGCTGTGA 941 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

781 CTGGACCGTGCGTTTCCGCCCCCGAGCGGACCGCGACCACTGGAAATGCTGTGA 834 



RESULT 2 
ABU22187 

ID ABU22187 standard; protein; 281 AA. 
XX 

AC ABU22187; 
XX 

DT 19-JUN-2003 (first entry) 
XX 

DE Protein encoded by Prokaryotic essential gene #7714. 
XX 

KW Antisense; prokaryotic essential gene; cell proliferation; drug design. 
XX 

OS Burkholderia mallei. 
XX 

PN WO200277183-A2. 
XX 

PD 03-OCT-2002. 
XX 

PF 21-MAR-2002; 2002WO-US009107 . 
XX 

PR 21-MAR-2001; 2001US-00815242 . 

PR 06-SEP-2001; 2001US-0094 8993 . 

PR 25-OCT-2001; 2001US-0342923P. 

PR 08-FEB-2002; 2002US-00072851 . 

PR 06-MAR-2002; 2002US-0362699P . 
XX 

PA (ELIT-) ELITRA PHARM INC. 
XX 

PI Wang L, Zamudio C, Malone C, Haselbeck R, Ohlsen KL, Zyskind JW; 

PI Wall D, Trawick JD, Carr GJ, Yamamoto R, Forsyth RA, Xu HH; 

XX 

DR WPI; 2003-029926/02. 

DR N-PSDB; ACA26057. 
XX 

PT New antisense nucleic acids, useful for identifying proteins or screening 

PT for homologous nucleic acids required for cellular proliferation to 

PT isolate candidate molecules for rational drug discovery programs. 
XX 

PS Claim 25; SEQ ID NO 50111; 1766pp; English. 
XX 

CC The invention relates to an isolated nucleic acid comprising any one of 

CC the 6213 antisense sequences given in the specification where expression 

CC of the nucleic acid inhibits proliferation of a cell. Also included are: 

CC (1) a vector comprising a promoter operably linked to the nucleic acid 

CC encoding a polypeptide whose expression is inhibited by the antisense 

CC nucleic acid; (2) a host cell containing the vector; (3) an isolated 

CC polypeptide or its fragment whose expression is inhibited by the 

CC antisense nucleic acid; (4) an antibody capable of specifically binding 

CC the polypeptide; (5) producing the polypeptide; (6) inhibiting cellular 

CC proliferation or the activity of a gene in an operon required for 

CC proliferation; (7) identifying a compound that influences the activity of 

CC the gene product or that has an activity against a biological pathway 

CC required for proliferation, or that inhibits cellular proliferation; (8) 

CC identifying a gene required for cellular proliferation or the biological 

CC pathway in which a proliferation-required gene or its gene product lies 

CC or a gene on which the test compound that inhibits proliferation of an 

CC organism acts; (9) manufacturing an antibiotic; (10) profiling a 



CC compound's activity; (11) a culture comprising strains in which the gene 

CC product is overexpressed or underexpressed; (12) determining the extent 

CC to which each of the strains is present in a culture or collection of 

CC strains; or (13) identifying the target of a compound that inhibits the 

CC proliferation of an organism. The antisense nucleic acids are useful for 

CC identifying proteins or screening for homologous nucleic acids required 

CC for cellular proliferation to isolate candidate molecules for rational 

CC drug discovery programs, or for screening homologous nucleic acids 

CC required for proliferation in cells other than S. aureus, S. typhimurium, 

CC K. pneumoniae or P. aeruginosa. The present sequence is encoded by one of 

CC the target prokaryotic essential genes. Note: The sequence data for this 

CC patent did not form part of the printed specification, but was obtained 

CC in electronic format directly from WIPO at 

CC f tp.wipo. int/pub/published_j>ct_sequences 
XX 

SQ Sequence 2 81 AA; 

Query Match 55.4%; Score 806; DB 6; Length 281; 

Best Local Similarity 55.2%; Pred. No. 2e-69; 

Matches 153; Conservative 39; Mismatches 85; Indels 0; Gaps 0; 

Qy 1 MRRMTLPSGESIPVLGQGTWGWGEDPGRRGDEVAALHAGLELGMTLVDTAEMYADGGAEE 60 

: : II II H llllll II I :| hill hHlllhllllll II I 
Db 5 LETVALPGGERVPKLGQGTWEMGERPAKRAAEIAALREGVDLGMTLIDTAEMYGDGATET 64 

Qy 61 VAGEALAGRRDEAFWSKVMPSHASRSGTIAACERSLKRLGTDRIDLYLLHWQGRYPLQD 12 0 

: hill h hMI|:| MM I Mill MMI MhlMMIhl M 

Db 65 LVGDALADVRERLFIVSKVLPHHASRGGWAACEATLKRLRTDRVDLYLLHWRGSIPLAE 124 

Qy 121 TVAAFHQLVEDGKIRYWGVSNFDHRALAELQDVPGTSGLTTDQVLYNLSRRGPEYDLLPW 180 

Ml I I : MM Mill III I I : MhlllhlllMMIIII 

Db 125 TIAGFEALRDAGKIRYWGVSNFDVDDMEALVAEAGGAVCATDQILYNLARRGPEFDLLPW 184 

Qy 181 CADHQLPVMAYSPIEQGRILDDTTLNDVAARHSVSPAAAALAWVLRRDSLCTIPKASSPQ 240 

I ::i Mlllh h i h:M III llllll MM 1 : 

Db 185 LARREMPAIAYSPIDHMRLPKRTALDEIARERGVSPTRVALAWVLGQPNVLAIPKAGSVE 244 

Qy 241 HVRDNATALDVELTREDLDALDRAFPPPSGPRPLEML 277 

Mill MM I hi II I II llllll 

Db 245 HVRDNRAALDLVLGEEELARLDAQFKSPRGKRPLEML 281 



RESULT 1 

US-09-252-991A-29664 

Sequence 29664, Application US/09252991A 
Patent No. 6551795 
GENERAL INFORMATION: 
APPLICANT: Marc J. Rubenf ield. et al . 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 

FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/09/252 , 991A 
CURRENT FILING DATE: 1999-02-18 
PRIOR APPLICATION NUMBER: US 60/074,788 
PRIOR FILING DATE: 1998-02-18 
PRIOR APPLICATION NUMBER: US 60/094,190 
PRIOR FILING DATE: 1998-07-27 
NUMBER OF SEQ ID NOS : 33142 
SEQ ID NO 29664 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Pseudomonas aeruginosa 
US- 09-252 -991A-2 9664 

Query Match 55.8%; Score 812; DB 4; Length 352; 

Best Local Similarity 56.8%; Pred. No. 4.6e-80; 

Matches 158; Conservative 44; Mismatches 70; Indels 6; Gaps 4 

Qy 1 MRRMTLPSGESIPVLGQGTWGWGEDPGRRGDEVAALHAGLELGMTLVDTAEMYADGGAEE 60 

h :|| h : :|IIM II II I Mill h I I II I h I I I I I I I : I I I I : 
Db 80 MKTLTLIDGQEVAAIGQGTWHIGEQPGERKREVAALREGIELGMTLIDTAEMYAEGGAED 13 9 

Qy 61 VAGEALAGRRDEAFWSKVMPSHASRSGTIAACERSLKRLGTDRIDLYLLHWQGRYPLQD 120 

M hlll|:M:|||| I HIM IMIIIhim lllllllllllllh: 
Db 140 WGAAIAGRREEVFLVSKVYPHNASRKGLPAACERSLRRLGCETIDLYLLHWQGRYPLEE 199 

Qy 121 TVAAFHQLVEDGKIRYWGVSNFDHRALAELQDVPGTSGLTTDQVLYNLSRRGPEYDLLPW 180 

h M :| : III lllllll I I hlhlll II MMIM 

Db 200 TIEAFERLRDQGKILRWGVSNFD LGDMYELDG -SACATNQVMYNLEERG I EYDLLPW 255 

Qy 181 CADHQLPVMAYSPIEQ-GRILDDTTLNDVAARHSVSPAAAALAWVLRRDSLCTIPKASSP 239 

I : HIIII h I Ihl I ::|MI I I lllhl : : MM : 

Db 256 CQERGMPVMAYCPVGQGGRLLRHPALGEIAARHDASSAQVALAWLLEQ-GVIAIPKAVTS 314 

Qy 240 QHVRDNATALDVELTREDLDALDRAFPPPSGPRPLEML 277 

hi M I hlh Ml Mhllllh I I 
Db 315 AHIRQNAAAADLELSADDLRALDQAFPPPTRKRNLAIV 352 



RESULT 2 

US-09-489-03 9A-78 96 

Sequence 7896, Application US/09489039A ) 
Patent No. 6610836 
GENERAL INFORMATION: 
APPLICANT: Gary Breton et . al 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID .SEQUENCES RELATING TO 
KLEBSIELLA 

TITLE OF INVENTION: PNEUMONIAE FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 2709.2004001 
CURRENT APPLICATION NUMBER: US/09/489 , 039A 
CURRENT FILING DATE: 2000-01-27 
PRIOR APPLICATION NUMBER: US 60/117,747 
PRIOR FILING DATE: 1999-01-29 
NUMBER OF SEQ ID NOS : 14342 
SEQ ID NO 7896 
LENGTH: 324 
TYPE: PRT 

ORGANISM: Klebsiella pneumoniae 
US-09-489-03 9A-7896 

Query Match 47.2%; Score 686.5; DB 4; Length 324; 

Best Local Similarity 48.9%; Pred. No. 2e-66; 

Matches 139; Conservative 45; Mismatches 93; Indels 7; Gaps 3 

Qy 1 MRRMTLPSGE--SIPVLGQGTWGWGEDPGRRGDEVAALHAGLELGMTLVDTAEMYADGGA 5 8 

n h li ::| :| Ml II U Mill |:: hhMIIIIIIIMI 
Db 41 MVKKTVRFGEQAAVPAIGLGTWYMGEHAAQRQQEVAALRVGIDHGLTVIDTAEMYADGGA 100 

Qy 59 EEVAGEALAGRRDEAFWSKVMPSHASRSGTIAACERSLKRLGTDRIDLYLLHWQGRYPL 118 

Ml hh I II Mill I M III Ihll M MMIMhl II 

Db 101 EEWGQAIRGLRDRWLVSKVYPWHAGKAAMHRACENSLRRLQTDYLDMYLLHWRGDIPL 160 

Qy 119 QDTVAAFHQLVEDGKIRYWGVSNFDHRALAELQDVPGTSGLTTDQVLYNLSRRGPEYDLL 178 

hll I Ml Mill Mill I : I IMIIhh II Mill 

Db 161 QETVEAMEKLVAEGKIRRWGVSNLDTEDMQALWRTADGEHCATNQVLYHLASRGIEYDLL 220 

Qy 179 PWCADHQLPVMAYSPIEQ-GRILD DTTLNDVAARHSVSPAAAALAWVLRRDSLCTI 233 

Ml I llllll h I Ih I : : :M I MUM I 

Db 221 PWCQQHSLPVMAYCPLAQAGRLRDGLFQHSDIINMANARGITVAQLLLAWVIRHPGVLAI 280 

Qy 234 PKASSPQHVRDNATALDVELTREDLDALDRAFPPPSGPRPLEML 277 

lihl :|| II III: h hi III :|ll hh 
Db 2 81 PKAASIEHWQNAAALDIVLSGEELAQLDRLYPPPQRKTRLDMV 324 



RESULT 1 
A97604 

probable oxidoreductase (PA0804) [imported] - Agrobacterium tumefaciens (strain^ 
C58, Cereon) 

C ; Species : Agrobacterium tumefaciens 

C;Date: 30-Sep-2001 #seguence__revision 30-Sep-2001 #text_change 18-Nov-2002 
C/Accession: A97604 

R/Goodner, B,; Hinkle, G.; Gattung, S.; Miller, N. ; Blanchard, M. ; Qurollo, B.; 
Goldman, B.S.; Cao, Y. ; Askenazi, M. ; Hailing, C.; Mullin, L.; Houmiel, K. ; 
Gordon, J.; Vaudin, M.; lartchouk, O.; Epp, A.; Liu, F./ Wollam, C . ; Allinger, 
M.; Doughty, D.; Scott, C; Lappas, C.; Markelz, B.; Flanagan, C, ; Crowell, C. ; 
Gurson, J.; Lomo, C; Sear, C; Strub, G.; Cielo, C.; Slater, S. 
Science 294, 2323-2328, 2001 

A;Title: Genome Sequence of the Plant Pathogen and Biotechnology Agent 
Agrobacterium tumefaciens C58. 

A;Reference number: A97359; MUID : 21608551; PMID : 11743194 
A/Accession: A97604 
A; Status: preliminary 
A;Molecule type: DNA 
A/Residues : 1-281 <KUR> 

A; Cross -references: GB:AE007869; PIDN:AAK87786 . 1; PID : gl5157157 ; GSPDB :GN00169 
C; Genetics: 

A;Gene: AGR__C_3682 

A; Map position: circular chromosome 
C;Superf amily : aldehyde reductase 

Query Match 55.7%; Score 810; DB 2; Length 281; 

Best Local Similarity 57.7%; Pred. No. 3.5e-62; 

Matches 158; Conservative 35; Mismatches 81; Indels 0; Gaps 0; 

Qy 4 MTLPSGESIPVLGQGTWGWGEDPGRRGDEVAALHAGLELGMTLVDTAEMYADGGAEEVAG 63 

= 11111 :| II III II \\: :: II I I I I I I I I I I I I I : I I I I 

Db 8 VTLPSGNEVPALGLGTWNMGETRSSADDEIESIRKAIDLGMTLVDTAEMYADGRSEEWG 67 

Qy 64 EALAGRRDEAFWSKVMPSHASRSGTIAACERSLKRLGTDRIDLYLLHWQGRYPLQDTVA 123 

hllllll hllll I :|| II IIIIM Mill lilllllhl Ml MM 

Db 68 TAIAGRRDEVFLVSKVYPWNASARGTAEACERSLARLGTDHIDLYLLHWRGEHPLGETVA 127 

Qy 124 AFHQLVEDGKIRYWGVSNFDHRALAELQDVPGTSGLTTDQVLYNLSRRGPEYDLLPWCAD 183 

11 M MM IlilMI : II II MIIIIIIIIMh Mill = 

Db 128 AFERLKSDGKIGNWGVSNFDTDDMEELFTVPEGKNCAANQVLYNLSRRGPEFSLLPWCQE 187 

Qy 184 HQLPVMAYSPIEQGRILDDTTLNDVAARHSVSPAAAALAWVLRRDSLCTIPKASSPQHVR 243 

1 H:|IIMIIIIIM : I :| : HI MhH II : llhH : 
Db 188 HGVPLMAYSPIEQGRILKNHELIRIAKAYQATPAQLALAFLLDRDGVIAIPKSASVSRIV 247 

Qy 244 DNATALDVELTREDLDALDRAFPPPSGPRPLEML 277 

H I |:|:| II Ml llllh MM 
Db 248 ENRGATDLEITEEDWTALDAAFPPPTRKTSLEML 281 



RESULT 2 
AB2826 

aldo/keto reductase Atu2032 [imported] - Agrobacterium tumefaciens (strain C58, 
Dupont ) 

C;Species: Agrobacterium tumefaciens 

C;Date: ll-Jan-2002 #sequence_revision ll-Jan-2002 #text_change 18-Nov-2002 



C/Accession: AB2826 

R;Wood, D.W.; Setubal, J.C.; Kaul, R. ; Monks, D.; Chen, L.; Wood, G.E.; Chen, 
Y.; Woo, L.; Kitajima, J. P.; Okura, V.K.; Almeida Jr., N.F.; Zhou, Y, ; Bovee 
Sr., D.; Chapman, P.; Clendenning, J.; Deatherage, G. ; Gillet, W.; Grant, C; 
Guenthner, D. ; Kutyavin, T. ; Levy, R. ; Li, M.; McClelland, E. ; Palmieri, A. ; 
Raymond, C; Rouse, G.; Saenphimmachak, C; Wu, Z.; Gordon, D.; Eisen, J.A. ; 
Paulsen, I.; Karp, P.; Romero, P.; Zhang, S. 
Science 294, 2317-2323, 2001 

A;Authors: Yoo, H. ; Tao, Y.; Biddle, P.; Jung, M. ; Krespan, W.; Perry, M. ;■ 
Gordon-Kamm, B.; Liao, L.; Kim, S.; Hendrick, C; Zhao, Z.; Dolan, M. ; Tingey, 
S.V.; Tomb, J.; Gordon, M.P,; Olson, M.V.; Nester, E.W. 

A; Title: The Genome of the Natural Genetic Engineer Agrobacterium tumefaciens 
C58. 

A/Reference number: AB2577; MUID : 21608550 ; PMID : 11743193 
A/Accession: AB2826 
A; Status: preliminary 
A; Molecule type: DNA 
A;Residues: 1-281 <KUR> 

A; Cross -references : GB:AE0 08688; PIDN : AAL43 024 .1; PID ; gl774 04 88 ; GSPDB : GN00186 
A; Experimental source: strain CSS (Dupont) 
C;Genetics: 
A;Gene: Atu2032 

A; Map position: circular chromosome 
C; Super family: aldehyde reductase 

Query Match 55.7%; Score 810; DB 2; Length 281; 

Best Local Similarity 57.7%; Pred. No. 3.5e-62; 

Matches 158; Conservative 35; Mismatches 81; Indels 0; Gaps 0; 

MTLPSGESIPVLGQGTWGWGEDPGRRGDEVAALHAGLELGMTLVDTAEMYADGGAEEVAG 63 

HUM :| II III II Ih I I I II I I I I I I I M I : I I II 

VTLPSGNE VPALGLGTWNMGETRS S ADDE I ES IRKAIDLGMTLVDTAEM YADGRSEEWG 6 7 

EALAGRRDEAFWSKVMPSHASRSGTIAACERSLKRLGTDRIDLYLLHWQGRYPLQDTVA 123 

hlMMI hllll I :|| II MINI Mill lllllllhl Ml MM 

TAIAGRRDEVFLVSKVYPWNASARGTAEACERSLARLGTDHIDLYLLHWRGEHPLGETVA 127 

AFHQLVEDGKIRYWGVSNFDHRALAELQDVPGTSGLTTDQVLYNLSRRGPEYDLLPWCAD 183 

II M MM lllllll : II M Mllllllllllh Mill : 

AFERLKSDGKIGNWGVSNFDTDDMEELFTVPEGKNCAANQVLYNLSRRGPEFSLLPWCQE 187 

HQLPVMAYSPIEQGRILDDTTLNDVAARHSVSPAAAALAWVLRRDSLCTIPKASSPQHVR 243 
nhlllll.lllllll : I .-I : :|L II : |||::| : 

HGVPLMAYSPIEQGRILKNHELIRIAKAYQATPAQLALAFLLDRDGVIAIPKSASVSRIV 247 

DNATALDVELTREDLDALDRAFPPPSGPRPLEML 277 

= 1 I hhl II III llllh MM 



Qy 


4 


Db 


8 


Qy 


64 


Db 


68 


Qy 


124 


Db 


128 


Qy 


184 


Db 


188 


Qy 


244 


Db 


248 



